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Preface 

The  subject  of  this  report  in  its  broadest  aspects,  has  been  the  topic  of  mam 
conferences  between  personnel  of  the  Design  Climatology  Branch,  A KG  I..;  its 
contractor,  St.  Louis  University:  and  the  Air  Weather  Service.  The  latter  had 
expressed  requirements,  since  1960,  for  conditional  climatology  for  military- 
operations  aimed  at  short-range  predictions  of  I to  6 hours  in  advance,  and  mote 
recently  for  military  planning  of  worldwide  Air  Force  missions.  This  writer’s 
discussions  with  C'apt.  Albert  K.  Boehm,  AWS  DNT.  have  been  most  valuable. 
Capt.  Boehm  used  an  alternative  method  to  obtain. the  model  algorithm  front  the 
basic  equations  of  T.  W.  Anderson.  His  knowledge  and  expertise  in  the  statistical 

literature,  freely  shared,  are  gratefully  acknowledged. 
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Multi-Predictor  Conditional  Probabilities 

1.  I YI'ROIH  C.TION 


Beginning  with  the  climatic  frequency  of  a weather  condition  or  event,  it  is 
possible  to  estimate  the  modified  orobability  of  its  occurrence,  or  to  give  its 
conditional  probability  following  certain  antecedents.  Symbolically,  if  Y is  the 
predictand,  and  Xj, , Xn  are  n predictors,  then  it  is  desired  to  find 

P (Y  £ Yc  | Xj,  — , Xn) 

where  Yc  is  a threshold  value  of  the  predictand,  and  P is  the  symbol  for 
probability. 

It  need  not  be  assumed  that  Y and  X,  (i  - 1,  n)  are  d scribed  in  categories 
only.  Instead,  in  this  report,  all  weather  elements  are  assumed  transformable 
into  continuous  variables.  If  X or  Y is  transformed  into  a normal  Gaussian  vari- 
able, then  the  new  variable,  x or  y,  is  conveniently  called  the  transnormalized 
variable,  a term  first  suggested  to  this  writer  by  Capt.  Albert  Boehm,  Air 

Weather  Service.  The  predictors,  Xj, , Xn  and  predictand  Y are  considered 

specific  values  such  as  temperature  (for  example,  12°C)  or  cloud  cover  (for 

example,  3/10).  Each  transnormalized  counterpart,  xj, , xn  or  y has  mean 

zero  and  variance  1. 0 and  Gaussian  distribution. 


(Received  for  publication  5 October  1976) 
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While  often  times  there  are  analytical  formulas  for  the  transformation  of 
X to  x,  it  is  always  possible  to  do  this  transformation  and  obtain  a one-to-one 
correspondence,  through  the  cumulative  distribution  of  X or  by  plotting  X on 
normal  probability  paper1.  Essentially,  the  transnormalization  is  accomplished 
through  the  probabilities.  Thus: 

P(x  > xc)  P (X  > Xc) 

where  the  subscript  (c)  is  used  to  denote  an  assigned  threshold  value.  In  this 
report,  for  convenience,  the  symbol  for  probability  is  sometimes  written  as  P(?) 
to  denote  the  probability  of  equalling  or  exceeding  the  bracketed  quanPty.  Thus, 
Eq.  (1)  is  also  written  as 

P(xc)  P(XC)  . 

A difficulty  arises  when  the  minimum  of  X must  be  categorized,  such  as  "no 
rain".  When  this  happens,  arbitrarily,  half  of  the  relative  frequency  of  no  rain 
(XQ)  can  be  added  to  the  frequency  of  rain  to  give  P (X  > XQ). 

2.  mPOTIIKSIS 

The  analytical  model  for  conditional  probability,  or  algorithm,  is  based  on 
the  assumption  of  linear  dependence  of  the  transnormalized  predictand  on  trans- 
normalized  predictors.  Thus: 

y = ajXj  + ---  + anxn  +bn  (2‘ 

where  17  is  random  and  normally  distributed,  alf , an  are  partial  regression 

coefficients,  and  b is  a coefficient  with  magnitude  such  that  the  normality  of  v >s 
preserved. 

3.  mi.YSIS 

Because  of  normality,  correlation  coefficients  (hereafter  abbreviated  as  cc) 
are  given  by: 


1.  Gringorten,  1. 1.  (1972)  Conditional  probability  for  an  exact  noncategorized 
initial  condition.  Monthly  Weather  Review  100:796-798. 
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where  p.^  is  the  symbol  for  cc  between  predictors  Xj  and  x.,  and  p.  is  the  symbol 
for  cc  between  predictand  (y)  and  the  it h predictor  (Xj). 

From  Eq.  (2),  after  squaring  both  sides  of  the  equation  and  taking  expected 
values, 


By  multiplying  both  sides  of  (2)  by  x-  and  taking  expected  values 


then  Eq.  (4)  becomes  R CA. 

By  premultiplying  both  sides  of  this  last  equation  by  the  inverse  of  C 


where  |c|  is  the  determinant  corresponding  to  C;  C is  the  adjoint  matrix  of  C,  or 
the  matrix  of  cofactors  (p . );  p..  (-1)*  | M.j  | where  |m^|  is  the  determinant 

of  the  submatrix  of  order  (n-1)  obtained  by  deleting  the  ith  row  and  jth  column 
from  C.  “ 

From  Eq.  (3) 

b2 3  = 1 - | atca|  =1-|  atr| 


or 


b Vl-ajPj-...  - anpn  . (9) 

From  Eq.  (2)  for  specific  values  of  x^, , xn  the  value  of  r)  will  exceed  r)c  as 

frequently  as  y exceeds  an  assigned  minimum  yc  or 

P (r)  > nc)  = P (y  > yc  | xy  ---  , xn>  . (10) 

Or,  the  probability  that  r)  will  exceed  nc  is  the  conditional  probability  that  y will 
exceed  y . Its  solution,  therefore,  is  given  by  the  solution  of  nc  in 

rj  = (y  - a, x.  - ...  - a x ) /b  (11) 

'c  Jc  11  n n ' 

where  a;  (i  - 1,  n)  and  b are  given  by  Eqs.  (8)  and  (9)  supported  by  Eqs.  (a)  and  (6). 
Since  r)  has  a Gaussian  distribution,  the  probability  of  it  exceeding  n is  given  in 
tables  of  almost  any  text  in  statistics  or  approximately  by  algorithm  (see  below). 
Capt.  Albert  R.  Boehm,  in  personal  correspondence,  revealed  that  the 

3 

results  in  Eqs.  (10)  and  (11)  are  obtainable  from  the  deductions  of  T.  VV.  Anderson. 
Anderson's  theorem  bears  promise  of  further  enhancement  of  the  results  by  giving 
solutions  for  the  conditional  probability  of  several  predictands  in  combination,  as 
well  as  predictors  in  combination.  This  possibility,  however,  has  not  yet  been 
examined. 


2.  Hohn,  Franz  E.  (1964)  Elementary  Matrix  Algebra,  The  MacMillan  Company, 

New  York. 

3.  Anderson,  T W.  (1958)  An  Introduction  to  Multivariate  Statistical  Analysis, 

John  Wiley  and  Sons,  New  York. 
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3.1  Single  Predictor 


For  a single  predictor  R p,  C l.  and  consequently  a 


whence 


a result  previously  obtained 


3.2  Two  Predictors 


In  this  case,  the  solution  for  n in  Eq.  (Ill  becomes 


where 


In  terms  of  the  cc's  (p  and  „ I.  the  expressions  for  a,,  a become  too 
• ij  in 

long  and  complicated  for  hand-written  analysis.  But  solutions  are  feasible  if  a 
computer  technique  will  provide  numerical  answers  for  A in  Eq.  (8),  using 
Eqs.  (5)  and  (6)  as  input.  One  such  computer  program  was  used  as  a subroutine 
in  the  example  described  below  . It  is  known  as  the  "CDC-6600  Subroutine 
MATRIX".  The  method  of  this  subroutine  is  basically  the  classical  Gauss-Jordan 

4 

Method.  Once  the  vector  A,  or  the  partial  regression  coefficients  (a^, , a ), 

are  determined,  then  b is  determined  from  Eq.  <91. 

Now,  the  values  of  the  transnormalized  predictors,  Xj, , xn  and  predictand 

y may  be  given.  Alternatively,  each  x,  may  need  to  be  found  from  the  given 
probability  of  Xj  being  equalled  or  exceeded.  W here  P(x)  p. 


Dixon,  W.J.  (1965)  BMD  Biomedical  Computer  Programs,  I'niversity  of 
California  at  Los  Angeles,  Health  Sciences  Facility,  Los  Angeles. 


the  value  of  x can  be  approximated  to  a high  degree  of  accuracy  by  a convenient 
5 

algorithm: 


x = k ft  i — 

. 1 + 0 


30753  + 0.  2706  It 
99229t  + 0.  0448 


?] 


(15) 


where  

k = 1,  t =\tn{l /p2)  for  p < 0.  5,  and 


k = -1,  t = V*nl  1/(1  -p)2  l for  p > 0.  5. 


Once  xt  and  yc  are  known,  then  nc  is  determined  by  Eq.  (11),  and  the  condi- 
tional probability  follows  by  Eq.  (10).  Again  a useful  approximation  is  provided 
by  an  algorithm:'’ 


Pile* 


f + m 


£2^1  + cx  rj 


c + C2nc2  + C3nc3+C4 


4\41  _1 

nc  ; J 


(16) 


where 


cx  = 0.  196854, 

c2  = 0.  115194, 

c3  = 0.000344, 

c.  = 0.  019527, 

4 

1 = 0,  m = 1 for  rj^O,  and 

I = 1,  m = -1  for  t)  < 0. 


t.  SAMPLE  APPLICATION 

For  the  following  illustrative  application,  some  readily  available  data  were 
chosen  from  another  recent  study.  Records  of  WSR-57  radar  pictures  at  five 
stations  in  the  eastern  half  of  the  United  States,  in  the  midseasonal  months  of 
January,  April,  July  and  October,  taken  every  3 hours,  that  is,  eight  times  per 
day,  were  used.  The  five  stations  were:  (1)  Minneapolis,  Minnesota,  (2)  Key 
West,  Florida,  (3)  Wichita,  Kansas,  (4)  Cape  Hatteras,  North  Carolina, 

(5)  Evansville,  Indiana. 


5.  National  Bureau  of  Standards  (1964)  Handbook  of  Mathematical  Functions 
with  Formulas,  Graphs  and  Mathematical  Tables,  Applied  Mathematics 
Series  No.  55,  for  sale  by  Supt.  of  Documents,  U,  S.  Govt.  Printing  Office, 
Washington,  D.  C. 
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I Figure  I id  an  example  of  a radar  picture.  The  observer  usually  sees  the 

negative  of  pictures  like  Figure  1,  so  that  radar  echoes  from  rainstorms  appear 
as  white  clouds  on  a black  background  surrounding  the  radar  station.  In  Figure  1, 
the  outermost  circle  lias  a radius  of  125  nmi,  and  the  second  circle  100  nmi. 

A large  storm  is  shown  centered  approximately  85  nmi  northeast  of  the  radar 
station. 

1. 1 llalii  I olh'i  lioll 

To  study  each  picture,  a transparent  template  was  superimposed  over  it  — as 
shown  in  Figure  1 — to  divide  the  area  between  the  25-nmi  circle  and  the  100-nmi 


circle  into  04  cells,  each  of  400  nmi“.  Kach  of  the  04  cells  was  examined  to 
estimate  what  fraction  of  the  cell,  in  eights  or  oktas'1  was  covered  by  precipitation 
echo.  The  coverage  was  recorded  as  zero  for  no  coverage,  one  okta  for  1 8 

coverage,  , eight  oktas  for  full  coverage  of  the  cell.  For  this  exercise  the 

cells  were  grouped  into  eight  equal-area  groups,  each  group  consisting  of  a ring  or 
part  of  a ring,  and  numbered  1 to  8 (Figure  2).  A record  of  the  number  of  oktas 
(from  0 to  84)  in  each  group  was  made  for  each  l-hourly  picture.  As  an  example, 
in  Figure  1,  the  number  of  oktas  in  Groups  1 to  8 were  estimated,  by  eye.  as  2, 

8,  8,  1,  17,  8,  3,  0,  respectively. 


Figure  1.  Sample  of  a PPI  Radar-scope  Figure  2.  Illustration  of  the  Eight  Equal 

Picture  of  Storms  Within  125  nmi  of  Areas,  Each  Obtained  by  the  Grouping 

the  Radar  Station  of  Eight  Cells  (Figure  1) 

8.  McIntosh,  D.  H.  (1972)  Meteorological  Glossary,  Chemical  Publishing,  New 
York. 
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Table  1 shows  an  example  of  the  entries  of  the  okta  counts  in  groups  1 to  8. 
The  okta  count  in  group  1 was  chosen  as  the  predictand.  The  okta  counts  in 
groups  2 to  8 were  chosen  as  the  seven  predictors,  and  were  matched  with  okta 
counts  in  group  1 at  time  lags  of  zero  (for  simultaneous  coverage),  3 hours, 

6 hours,  9 hours,  and  12  hours  later.  Thus,  in  the  example  of  Table  1,  the  num- 
bers on  the  first  line  (17,  18,  35,  4,  26,  30,  9)  were  tested  as  predictors  of  the 
okta  count  (11)  on  the  first  line,  then  of  the  okta  count  (40)  on  the  second  line, 
then  (12)  on  the  third  line,  then  (7)  on  the  fourth  line,  and  lastly  (2)  on  the  fifth 
line. 


Table  1.  Sample  Data  Collection  of  Radar  Coverage  of  Precipitation.  The 
example  is  for  Minneapolis,  Minnesota,  26  October  1970.  The  entries  are 
the  okta  counts 


Hour 

Group 

1 

2 

3 

4 

5 

6 

7 

8 

0100 

1 1 

17 

18 

35 

4 

26 

30 

9 

0400 

40 

31 

35 

9 

12 

18 

7 

8 

0700 

12 

13 

9 

9 

30 

3 

1 

8 

1000 

7 

7 

8 

8 

19 

9 

3 

18 

1300 

2 

4 

17 

0 

10 

3 

0 

0 

1600 

0 

0 

0 

0 

4 

1 

0 

0 

1900 

0 

0 

0 

0 

0 

0 

0 

0 

2200 

0 

0 

0 

0 

0 

0 

1 

0 

For  each  station  the  number  of  radar  pictures,  in  one  midseasonal  month,  was 
equal  to  the  number  of  years  (6)  times  the  number  of  days  per  month  (30  to  31) 
times  the  number  of  3-hourly  periods  per  day  (9)  for  a total  of  1440  or  1488  pic- 
tures, less  a generally  small  number  of  pictures  missing  from  the  record.  Most 
of  the  pictures,  by  far,  showed  no  precipitation  echoes,  revealing  that  there  was 
little  or  no  precipitation  on  most  days. 


1.2  f inding  the  Correlations 

For  each  station  and  midseason  month,  a computer  program  yielded  the 
following: 

(1)  The  sample  size  (<  1488). 

(2)  The  number  of  missing  observations,  which  together  with  the  sample  size 
added  up  to  1440  or  1488. 

(3)  The  sample  size,  in  which  the  seven  predictors  (groups  2 to  8)  of  one 
picture  could  be  matched  to  the  predictand  (group  1)  at  lag  times  0,  3,  6,  9,  or  12 
hours  later  . 
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(4)  A table  of  the  frequency  of  the  areal  coverage  in  each  group  tl  to  8) 

starting  with  full  coverage  (ii4  oktas),  then  63  oktas, 2 oktas,  1 okta,  an 

lastly  none  (for  example.  Table  2). 


Table  2.  The  Frequency  of  Fractional  Cover,  in  Oktas.  The  coverage  is  by 
radar  echoes  as  in  Figure  1.  The  case  is  for  Minneapolis,  Minnesota, 
January  1969-1974. 


Okta 

Count 

<L) 


Total 


(5)  A table  of  the  cumulative  relative  frequency  of  coverage  equal  to  or 
greater  than  L oktas  in  each  group,  for  L 64,  63, , 1,  0 (for  example, 

n 

Table  3).  The  Blom  formula  was  used  to  calculate  relative  frequency. 


Table  3.  The  Cumulative  Relative  Frequency  of  Coverage  Equal  to  or  Greater 
Than  E Oktas  The  example  is  for  Minneapolis,  Minnesota,  January 


Okta 

Count 

(L) 


*This  relative  frequency  is  obtained  by  adding  half  of  the  frequency  of  no  echo  to 
the  total  number  of  cases  with  echoes,  and  dividing  by  the  overall  sample  size. 


Blom,  Gunnar  (1958)  Statistical  Estimates  and  Transformed  Beta -Variables 
John  VVilev  and  Sons,  New  York. 


Group  (i) 

1 

2 

3 

4 

5 

6 

7 

8 

0.0004 

_ 

_ 

_ 

_ 

_ 

_ 

0.  0004 

- 

- 

- 

- 

- 

- 

- 

0.  0018 

0.0004 

- 

- 

- 

- 

- 

- 

0.  115 

0.  095 

0.  047 

0.  046 

0.  0074 

0. 0102 

0.  0074 

0. 0088 

0.  122 

0.  103 

0.  056 

0.  057 

0.  0095 

0. 0137 

0. 0137 

0. 0116 

0.  132 

0.  115 

0.  0761 

0. 0691 

0.  0137 

0. 0193 

0.0179 

0.  0193 

0.  148 

0.  138 

0. 0978 

0. 0908 

0.  0228 

0.  0277 

0. 0305 

0.  0263 

0.  575 

0.  5(>9 

0.  549 

0.  54  f> 

0.  512 

0.  514 

0.  515 

0.  514 

1 


(6)  A table  of  the  transnonnalized  variable  (xj)  corresponding  to  the  okta 
count  (L)  in  the  ith  group.  This  was  obtained  by  substituting  relative  frequency 
for  the  probability  (p)  in  Eq.  (15)  (for  example.  Table  4). 


Table  4.  The  Values  of  the  Transnonnalized  Variables  Corresponding  to  th  • 
Relative  Frequencies  of  Table  3 


Okta 
C ount 

Group  (i) 

1 

2 

3 

4 

5 

6 

7 

8 

(L) 

64 

3.33 

_ 

. 

_ 

. 

_ 

- 

63 

3.  33 

- 

- 

- 

- 

- 

- 

- 

62 

2.91 

3.33 

“ 

- 

- 

- 

4 

1.  20 

1.31 

1. 68 

1. 68 

2.  44 

2.  32 

2.  44 

2.  37 

3 

1.  16 

1.  26 

1.  .38 

1.  58 

2.  34 

2.  21 

2.  21 

2.  27 

2 

1.  12 

1.20 

1.43 

1. 48 

2.  21 

2.07 

2.  10 

2.  07 

i 

'.04 

1. 09 

1.  29 

1.34 

2.  00 

1.  92 

1.  87 

1.  94 

0 

-0.  19 

-0.  17 

-0.  12 

-0.  1 1 

-0.03 

-0.  04 

-0.  04 

-0.  03 

(7)  The  matrix  of  the  correlation  coefficients  (p.^)  between  the  predictors 
(for  example.  Table  5). 


Table  5.  The  Cross-Correlation  Coefficients  (p . ) Between  t he  Trans- 
normalized  Values  of  the  Fractional  Cover  or  Okra  Count  in  Groups 
1 to  8.  The  example  is  for  Minneapolis,  Minnesota,  January 


Group 

(j> 

Group  (i) 

1 

2 

3 

4 

5 

h 

7 

8 

1 

1. 00 

0.  90 

0.  72 

0.  68 

0.  32 

0.  44 

0.  43 

0.  29 

2 

1. 00 

0.  81 

0.  77 

0.  39 

0.  49 

0.49 

0.  36 

3 

1. 00 

0.  63 

0.  51 

0.  59 

0.  42 

0.  37 

4 

1. 00 

0.  38 

0.41 

0.  62 

0.  54 

5 

1. 00 

0.  28 

0.  24 

0.  51 

6 

1. 00 

0.  44 

0.  21 

7 

1. 00 

0.  26 

8 

1.00 

1 

(8)  A matrix  of  correlation  coefficients,  each  row  of  which  consisted  of  the 
cc  (p^)  between  the  predictand  (v)  and  each  predictor  Xj  (i  2,  8).  Table  (i  is  an 
example.  In  each  successive  row,  the  predictand  is  the  transnonnalized  value  of 
the  okta  count  at  time  0,  3,  (i,  9,  12  hours  later. 


A 


lfi 


Table  6.  The  cc  of  the  Predmtand  With  Each  Predictor.  The  First  Line  is  for 
the  group  1 predictand  occurring  at  the  same  time  as  the  predictors.  The 

second  line  is  for  the  predictand  occurring  3 hr  later, , the  fifth  line  is  for 

the  predictand  occurring  12  hr  later.  The  column  headed  "1"  gives  the 
incidental  information  of  the  correlations  of  group  1 of  one  time  with  group 
1 of  0,  3,  6,  9,  12  hr  later 


Predictand 
Group  1 


0 hours 
3 
6 
9 

12 


Predictor  Group  (i) 


1.3  Finding  the  Partial  Regression  Coefficients 

Given  the  p^  and  p-  for  one  station,  one  month  and  a time  lag,  the  computer 

solution  of  Eqs.  (8)  and  (9)  yielded  values  for  ag, , ag  and  b.  Table  7 shows 

the  set  of  values  of  aj  (i  = 2, , 8)  and  b for  time  lags  0 to  J 2 hours,  for  Key 

West,  Florida,  in  January.  It  demonsti utes  decreasing  importance,  with  time,  of 
group  2 as  a predictor  of  group  1 and  the  varying  importance  of  the  other  predictors. 
Group  8,  which  is  situated  northwest  of  the  radar  station  (Figure  2)  is  a better 
predictor  of  the  weather  of  group  1 than  group  2 for  tjirne  lags  6,  9 and  12  hours. 

This  is  easily  understood  since  the  weather  tends  to  ^nigrate  eastward. 


Table  7.  The  Partial  Regression  Coefficient  of  Eq.  (11)  as  Determined  for 
Key  West,  Florida,  January,  for  Time  Lags  0,  3,  6,  9,  12  hr  Between 
Predictors  and  Predictand 


Time 

Lag 

(Hours) 

a2 

a3 

a4 

a5 

a6 

a7 

a8 

b 

0 

0.751 

0.  071 

-0. 007 

0.  079 

-0. 023 

0.  031 

0.  033 

0.  474 

3 

0.  31G 

0.  139 

0.  136 

-0.  038 

0.  004 

0.  120 

0.  197 

0.  675 

G 

0.  118 

0.  076 

0.  102 

-0.  029 

0.  082 

0.  125 

0.  263 

0.  809 

9 

0.  010 

0.053 

0.  049 

0.  000 

0.  110 

0.  159 

0.  219 

0.  881 

12 

0.  034 

0.  04G 

-0.  001 

-0.  004 

0.  100 

0.  133 

0.  159 

0.  931 

This  last  consideration  prompts  an  examination  of  the  coefficients  at  one 
station  (Key  West)  to  compare  their  values  from  season  to  season  (Table  8).  For 
Key  West,  Florida,  at  a time  lag  of  G hours,  they  show  a greater  importance,  in 
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Table  8.  The  Coefficients  Divided  by  b.  Determined  for  Key  West,  Florida 
for  Time  Lag  of  6 hrs,  in  January,  April,  July,  and  October 


Midseason 

Month 

(aj/b) 

i = 2 

3 

4 

5 

6 

7 

8 

January 

m 

0.  13 

-0.04 

0.  10 

0.  15 

0.32 

April 

in 

0.09 

0.  15 

0.  14 

-0.  06 

0.  12 

0.  17 

0.  832 

July 

0.  13 

0.02 

0.  13 

0.  11 

0.  15 

-0.  03 

0.  19 

0.887 

October 

0.  13 

0.  18 

0.  17 

0.  09 

0.  12 

0.  24 

0.  11 

0.  770 


winter  of  the  condition  55  to  73  miles  to  the  northwest  (group  8)  than  the  weather 
closer  to  the  area  of  verification  (group  2).  This  is  true  in  summer  also,  although 
not  as  spectacularly.  The  month  of  October  offers  the  best  opportunity  for  pre- 
diction by  conditional  probability.  In  the  fall  there  must  be  a more  prevalent 
southwesterly  flow  (group  7)  than  northwesterly. 

(.  i Estimating  Conditional  Probability 

Given  coefficients  83, , ag  and  b,  the  model  of  Eqs.  (11)  and  (10)  yields 

probability  estimates.  For  example,  on  3 January  1974,  at  0100,  at  Key  West, 
Florida,  the  seven  predictor  values,  in  oktas,  were  observed  to  be  2,  0,  2,  0,  0, 

1,  1 oktas,  which  are  transnormalized  into  x = 1.  219,  -0.453,  1.223,  -0.255, 

-0.371,  0.924,  1.449,  respectively.  The  predictand  values,  at  time  lags  3,  6,  9, 

12  hours  later  were  1,  18,  25  and  20  oktas  respectively,  which  had  (unconditional) 
frequencies  of  being  equalled  or  exceeded:  0.  1G,  0.026,  0.017,  0.022  respectively, 
and  therefore  were  transnormalized  into  y0  0.977,  1.937,  2.  122  and  2.006, 
respectively.  Inserting  these  transnormalized  values  and  the  coefficmr'fs, 
a}  ( i = 2,  8)  and  b (Table  7)  into  Eq.  (11),  the  values  of  qc  become  0.  124,  1.  52, 
1.865,  1.  791  and  consequently  the  conditional  probabilities,  through  Eq.  (16),  are 
P (rjc)  = 0.451,  0.064,  0.031,  0.036,  respectively.  As  to  be  expected,  the  increase 
of  the  conditional  probabilities- over  the  unconditional  probability  is  demonstrated 
best  on  the  3 -hour  lag. 

A priori,  Eqs.  (11)  and  (10)  are  applied  to  give  the  conditional  probability  that 
certain  thresholds  of  areal  coverage  are  exceeded.  As  an  example,  if  the  problem 
is  to  give  the  probability  that  group  1 will  have  8 oktas  or  more  coverage,  of  which 
the  unconditional  probability  is  0.  0413,  then  in  the  above  example  for  Key  West, 

3 January  1974  the  model  estimates  of  conditional  probabilities  for  3-,  6-,  9-, 
12-hour  lags  are  0.  106,  0.  102,  0.077,  0.066,  respectively.  These  are  all  improved 
probabilities  over  the  unconditional  probability.  In  verification,  there  was  an  echo 
on  all  except  the  3 -hour  lag. 
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A computer  program  was  written  to  calculate  the  conditional  probabilities  of 
the  fractional  cover  in  group  1,  successively  equal  to  or  greater  than  8 oktas. 


16  oktas,  32  oktas,  and  lastly  64  oktas  (complete  coverage),  given  the  number  of 
oktas  in  groups  2 to  8 at  the  initial  hour.  This  was  done  for  time  lags  of  0,  3,  6, 
9,  12  hours,  at  each  of  the  five  stations  for  each  midseasonal  month  and  for  each 
hour  for  which  there  was  a recorded  verification. 

The  following  illustration  is  for  Minneapolis  in  October.  The  climatic  fre- 
quency of  the  predictand,  equal  to  or  exceeding  the  fractional  cover,  was: 


Full  cover  (64  oktas) 

0.  0011 

Half  cover  (32  oktas) 

9.  0340 

One-quarter  cover  (16) 

0.  0764 

One-eighth  cover  (8) 

0.  1236 

Average 

0.  059 

The  total  nv.nber  of  " forecasts",  always  less  than  the  maximum  possible 
number  (4  x 1488),  is  shown  in  the  second  column  (Table  9).  The  number  of  times 
that  the  conditional  probability  was  computed  to  be:  0.  00  to  0.  099,  0.  100  to 

0.  199, , 0.900  to  0.999  is  shown  in  th~'  next  10  columns.  All  eleven  pieces  of 

information  are  shmvn  for  the  forecasts  of  time  lags  0,  3,  6,  9,  12  hours. 


Table  9.  The  Number  of  Times  That  the  Conditional  Probability  of 
Fractional  Coverage  of  1/8,  1 1/2  or  Full  Coverage  was 

Estimated  in  the  Range  0.  0 to  0.  09%  , 0.  1 to  0.  19%  , . The 

example  is  for  Minneapolis,  Minnesota,  October 


T ime 
Lag 

(Hours) 

Total 

No. 

Fcsts 

Conditional  Probability  Range 

0 

to 

0.  099 

0.  1 
to 

0.  199 

0.  2 
to 

0.  299 

0.  3 
to 

0.  399 

0.4 

to 

0.  499 

0 

5844 

5247 

123 

8 7 

51 

52 

3 

5780 

5169 

169 

124 

66 

73 

6 

5752 

5078 

312 

137 

101 

61 

9 

5724 

5049 

395 

164 

79 

31 

12 

5fi92 

5071 

413 

152 

47 

9 

0.  5 

0.  6 

0.  7 

0.  8 

0.9 

to 

to 

to 

to 

to 

0.  599 

0.  699 

0.  799 

0.  899 

0.  999 

0 

5844 

50 

42 

34 

51 

107 

3 

5780 

49 

52 

34 

28 

16 

6 

5752 

3 6 

21 

6 

0 

0 

9 

5724 

6 

0 

0 

0 

0 

12 

5692 

0 

0 

0 

0 

0 

19 
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Viewed  as  forecasts,  the  conditional  probability  estimates,  by  definition,  are 
sharp  when  they  are  close  to  1.  0 or  0.  0,  implying  a high  degree  of  confidence  by 
the  forecaster.  In  actual  fact,  there  are  many  low  estimates,  simply  because  the 
echo-producing  rainstorms  are  so  infrequent.  But  there  are  relatively  few  high 
conditional  probabilities.  Table  10  presents  the  overall  relative  frequency  of 
prediction,  for  all  five  stations  and  all  four  midseasonal  months.  An  estimate  of 
0.  70  or  higher  was  never  made  beyond  the  3 -hour  lag.  Even  on  the  zero  lag,  the 
degree  of  certainty  of  radar  echoes  in  group  1 — when  there  are  echoes  in  groups 
2 to  8 — is  not  generally  high. 


Table  10.  The  Frequency  (%  ) of  the  Estimates  of  Conditional  Probability  of 
Fractional  Coverage  (Including  8,  16,  32  and  64  Oktas)  Among  the  Cases 
When  the  Conditional  Prov  ability  was  Estimated  >0.  10.  The  figures  are 
averages  for  all  five  stations  and  midseasonal  months 


The  sharpness  of  the  conditional  probabilities,  as  a forecast  tool,  was  mea- 
sured by  a statistic  related  to  Brier's  score8  as  described  below.  I.et 

(1)  P(L)  P (n  rjc),  the  conditional  probability  that  the  areal  coverage 

(number  of  oktas)  in  group  1 will  equal  or  exceed  the  number  L ( 8,  16,  32,  64); 

(2)  F(L)  Climatic  frequency  or  'he  unconditional  probability  that  the 
number  of  oktas  in  group  l will  equal  or  exceed  the  number  L; 

(3)  V 1 when,  on  the  hour  of  verification,  the  number  of  oktas  equals  or 
exceeds  the  number  L:  and 

(4)  V 0 when,  on  the  hour  of  verification,  the  number  of  oktas  is  less  than 
the  number  L. 


8.  Brier,  G.W.,  and  Allen,  R.A.  (1951)  Verification  of  weather  forecasts, 

Compendium  of  Meteorology,  American  Meteorological  Society,  Boston, 
MA.,  925-996. 


T ime 

Conditional  Probability  Estimate: 

Lag 

(Hours) 

0.  10 

iC.  20 

>0.  30 

•0.  40 

0.  50 

>0.  60 

■0.  70 

>0.  80 

>0.  90 

0 

100% 

72% 

53% 

43% 

34% 

20% 

20% 

14% 

9% 

3 

100% 

57% 

37% 

23% 

14% 

9% 

4% 

2% 

1% 

6 

100% 

41% 

19% 

8% 

4% 

1"' 

- 

- 

- 

9 

100% 

26% 

8%, 

2% 

1% 

- 

- 

- 

- 

12 

100% 

10% 

2% 

- 

- 

- 

- 

- 

- 

For  this  test  of  verification  and  scoring,  L was  made  successively  equal  to  8,  16, 
32,  and  64  corresponding  to  fractional  coverages  of  1/8,  1/4,  1/2  and  full  cover- 
age, respectively. 

In  the  Brier  scoring  system,  if  the  estimated  conditional  probabilities  are 
correct,  or  perfectly  valid,  then  the  essence  of  the  test  is  in  the  sharpness  of  the 
probability,  P(L).  If  P(L)  is  greater  than  the  climatic  frequency,  F(L),  then  the 
difference  squared, 

Ef  = l P(U  - V l2 
would  be  smaller  than 

Eg  - l F(L)  - V \ 2 (18 

when  V I denoting  occurrence.  Conversely,  if  P(L)  is  smaller,  than  F(L), 
implying  lowered  confidence  in  the  occurrence  of  the  event  (L),  then  Ef  would  be 
smaller  when  the  event  does  not  occur. 

For  all  forecasts,  totalled  over  all  probability  estimates,  over  all  lags  and 
for  all  times  when  the  forecast  is  made,  the  total 

T = S [ P(D  - V |2 

is  intended  to  be  as  small  as  possible.  For  completely  sharp  probability  state- 
ments P(I.)  is  either  equal  to  1 or  0,  and  if  completely  matched  to  the  values  of  V, 
then  T would  be  identically  zero. 

7 

Skill,  as  apparently  first  propos'd  by  Heidke,  is  judged  by  the  advantage  of 
the  forecasts  over  the  information  in  the  climatic  frequencies.  If  the  symbol  C is 
used,  instead  of  T,  for  the  total  when  climatic  frequencies  are  used  as  probabili- 
ties, then 

C ■ Z | F(L)  - V )2  . 

The  skill  (S)  is  ultimately  judged  by 

S = (C  - T)/C  . 

The  statistic  S has  the  property  that  it  will  be  equal  to  1.  0 for  perfectly  sharp  and 
accurate  forecasts,  and  will  be  equal  to  0.  0 for  conditional  probability  statements 
that  do  not  differ  profitably  from  the  unconditional  probabilities.  It  is  possible 
for  forecast  probabilities  to  be  so  much  in  error  that  T will  be  greater  than  C,  to 


make  S negative.  Generally  speaking,  the  conditional  probability  will  be  greater 
than  the  unconditional  probability  when  there  is  an  improved  chance  of  the  event 
occurring,  and  smaller  for  a reduced  chance.  Hence,  the  term  Ef  will  generally 
be  less  than  the  corresponding  Ec.  All  told,  the  score  S is  best  when  it  is  closest 
to  1.0.  A value  of  0.0  for  S would  imply  that  the  conditional  probabilities  offer  no 
information  over  that  of  the  unconditional  probabilities.  Table  11  lists  all  of  the 
skill  scores  for  each  of  the  five  stations  by  midseasonal  month  and  lag.  As  to  be 
expected,  the  score  decreases  with  time  lag.  At  zero  lag,  or  for  simultaneous 
occurrences,  the  fractional  covers  over  groups  2 to  8 collectively  must  be  consi- 
dered as  fair  to  good  estimates  of  the  coverage  over  group  1.  But,  in  3 hours  the 
predictability  decreases  rapidly.  In  12  hours  it  almost  vanishes,  and  in  fact  does 
at  Cape  Hatteras,  North  Carolina. 


Table  11.  The  Skill  Scores  Which  Range  From  0 for  "No  Skill"  to  1.0  for 
Perfectly  Correct  Forecasts.  The  stations  are  (1)  Minneapolis,  Minnesota, 
(2)  Key  W'est,  Florida,  (3)  Wichita,  Kansas,  (4)  Cape  Hatteras,  North 
Carolina,  and  (5)  Evansville,  Indiana 


Time 

Lag 

(Hours) 


0 

0.  G3 

0.  65 

0.  66 

0.  76 

0.  73 

0.  71 

0.  54 

0.  58 

0.  65 

0.  70 

3 

0.  35 

0.40 

0.43 

0.  50 

0.  40 

0.  41 

0.  33 

0.  32 

0.  40 

0.  35 

6 

0.  14 

0.  19 

0.  20 

0.31 

0.  25 

0.  20 

0.  11 

0.  13 

0.  18 

0.  11 

9 

0.  07 

0.  08 

0.  10 

0.  14 

0.  14 

0.  11 

0.  06 

0.  03 

0.  10 

0.  05 

12 

0.  03 

0.  03 

0.04 

0.00 

0.  08 

0.  07 

0.  02 

0.  02 

0.  00 

0.  03 

July 

October 

0 

0.  64 

0.  42 

0.  53 

0.  56 

0.  52 

0.  71 

0.  55 

0.  60 

0.  59 

0.  72 

3 

0.  32 

0.  10 

0.  27 

0.  27 

0.  27 

0.44 

0.  32 

0.  35 

0.  32 

0.48 

6 

0.  06 

0.  02 

0.  09 

0.  10 

0.  08 

0.  23 

0.  20 

0.  17 

0.  17 

0.  27 

9 

0.00 

0.  01 

0.  03 

0.  05 

0.  04 

0.  12 

0.  13 

0.04 

0.  10 

0.  12 

12 

0.  00 

0.  04 

0.  01 

0.00 

0.02 

J 

0.  08 

0.  06 

0.  02 

0.  00 

0.  07 

SI  MM  ARY  AM)  CONCLUSIONS 


The  conditional  probability  of  Y equal  to  or  greater  than  a threshold  Yc,  given 

several  or  many  predictors  Xj, , Xn,  is  determined  from  the  transnormalized 

variables.  Symbolically 
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P(Y)  > Yc  | Xv  — . Xn)  - P(y  > yc  I xv  — , *n) 

= P (n  > nc ) 

where  yc>  Xj, , x are  the  transnormalized  variables  obtained  from  their  res- 

pective climatic  probabilities  (p)  through  Eq.  (14)  or  its  approximation  Eq.  (15). 

The  value  of  nc  is  obtained  through  Eq.  (11).  If  the  partial  regression 

coefficients  a.  ( i = 1, , n)  and  their  function  b are  not  provided,  they  are 

determinable  in  terms  of  the  cross-correlation  coefficients  between  the  predictors 
(p.^)  and  the  cc's  between  predictand  and  predictors  (p.)  through  Eqs,  (d)  and  (9). 
The  approximation  for  the  conditional  probability,  good  to  four  decimals,  is  given 
by  Eq.  (16). 

The  model  was  tested  on  some  readily  available  data,  consisting  of  radar- 
echo  coverage  around  each  of  5 stations  in  the  eastern  half  of  the  I’nited  States. 

The  tests  herein  for  verification  provided  no  measure  of  the  degree  of  validity  of 
the  conditional  probability  estimates.  But  there  is  strong  evidence,  in  Table  11, 
that  they  are  satisfactorily  valid,  because  the  skill  scores  fall  to  zero  but  do  not 
fall  below  zero. 

Assuming  validity,  the  Brier  scoring  system  provides  a measure  of  sharpness. 
Table  10  displays  a low  measure,  in  general.  But  the  model  is  capable  of  estima- 
ting high  probabilities  as  evidenced  by  the  frequencies  in  the  first  two  lines  of 
Table  10. 
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